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TECHNICAL FIELD 

The present invention relates in general to computer searching and in particular to 
generating computer search results that are optimized according to user preferences. 
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BACKGROUND 

In the prior art, search engines typically allow a user to enter a search query in the 
form of one or more keywords. In response to the query, a search engine will generally return 
a list of ranked results that is ordered by a numeric relevance score. An entry in the result list 
will typically have a short description, a hyperlink to the identified document, and possibly a 
numerical score indicating a degree of conformity with the search query. Typically the user 
is then given the option of reordering the results by an attribute of the document, for example 
by the date of creation of each document. Such re-ordering is generally executed after a 
search operation. 

One problem with this approach is that users generally want to recover the most 
relevant and most recent information. Moreover, many users may only examine the first 
three items within a search results list. When sorting by keyword matches, there is the 
possibility that the earliest-listed search results may be out of date. And, similarly, when - 
sorting by date, the earliest-listed results may have poor keyword relevance to the topic being 
searched. 

Another document attribute or meta attribute of a web page is the number of incoming 
links to the web page. The number of incoming links may serve as a useful surrogate for the 
level of authority likely to be granted to the web page by those recovering the document in a 
search. The level of importance given to incoming links is usually fixed by the search 
engine. A potential problem here is that selecting for a high number of links to a web page 
may operate to favor older pages since such pages generally garner more links as time passes 
and more pages establish links to the web page at issue. 

Accordingly, it is a problem in the art that search engines generally provide a single 
rigid choice between alternative methods of assigning priority to search results. 

It is a further problem in the art that the importance assigned to the number of 
incoming links to a web page is generally fixed by prior art search engines. 
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It is a still further problem in the art that optimizing a web search while optimizing for 
a first characteristic may generate search results in need of further sorting in order to isolate 
search results satisfying one or more additional characteristics. 
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SUMMARY OF THE INVENTION 

The present invention is directed to a system and method for searching for documents 
identified in a database, wherein the method comprises the steps of establishing a first search 
criterion associated with a keyword match between a keyword entry and the identified 
documents, establishing at least one additional search criterion based on a document attribute 
of the identified documents, determining a criterion matching score for identified documents 
for each of the established search criteria, associating a scaling factor with each of the 
established search criteria, calculating an overall matching score for a selection of the 
identified documents from the criterion matching scores and scaling factors associated 
therewith, and ordering the selection of identified documents based upon the calculated 
overall matching scores. 
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BRIEF DESCRIPTION OF THE DRAWING 

FIGURE 1 depicts a sequence of steps for retrieving search results according to a 
preferred embodiment of the present invention; 

FIGURE 2 depicts a mechanism for adjusting scaling factors for document attributes 
according to a preferred embodiment of the present invention; 

FIGURE 3 depicts data entry to and output from a search engine according to a 
preferred embodiment of the present invention; 

FIGURE 4 is a display of search result scores helpful in determining an overall 
document rank according to a preferred embodiment of the present invention; 

FIGURE 5 depicts an exemplary search result ordered by keyword matching; 

FIGURE 6 depicts the earliest listed results of a search ordered by document date; 

FIGURE 7 depicts later listed results of the same search depicted in FIGURE 6; 

FIGURE 8 depicts a listing of results arising from a search conducted according to a 
preferred embodiment of the present invention; and 

FIGURE 9 depicts computer apparatus adaptable for use with a preferred embodiment 
of the present invention. 
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DETAILED DESCRIPTION 

The present invention is directed to a system and method which integrates a plurality 
of meta attributes or document characteristics along with a keyword search result into a 
search engine document relevance ranking. The inventive approach preferably allows a user 
to select a plurality of attributes to employ in evaluating documents in a search operation, the 
5 direction of the user's preference for each of the attributes (such as, whether the user is 

searching for older or newer documents), and the relative weight to be accorded each of the 
selected attributes. An overall rank or matching score is preferably calculated from the 
individual criterion matching scores generated by appropriately combining such individual * 
criterion matching scores. 

VG Herein, the terms meta data, meta attributes, and document attributes generally 

y ) correspond to characteristics of a document such as age, number of incoming links, and 

„X • readability, but generally do not refer to an extent of keyword matching between such 

:T document and a keyword search. Herein, the term "search criterion" generally corresponds to 

s a basis for prioritizing a selection of documents from a group of documents, which basis 

if: pertains to one of the above-discussed document or meta attributes and/or to an extent to 

which a document matches a keyword search term. A search criterion relating to a document 
t attribute preferably includes a document attribute query or document attribute search query. 

A scaling factor may be coupled with such query to indicate a relative weighting of the search 

criterion with respect to other search criteria forming part of the same search. For example, a 
20 search criterion relating to document age could be presented in the following form: 0.5 [Age: 

more recent], wherein 0.5 is the scaling factor, and the "more recent" is a query indicating a 

preference for more recent documents. 

Alternative document attribute queries may be expressed, such as, for instance, where 
a readability index varies between 0 and 100, a query could be expressed as [readability 
25 {30,50}], indicating that only documents in the range of 30 to 50 will match the query. 

Additional data may be included in the query to indicate a preference for documents with 
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readability indexes closer to one or another end of a stated range. Of course, one or more such 
ranges could be specified. 



Generally, each search criterion within a search pertains to a different document 
attribute with one search criterion generally associated with keyword searching (where 
keyword matching generally does not relate to a document attribute. However, one or more 
search factors or search variables used in a search may be associated with a single search 
criterion. For example, a single keyword search criterion could include search factors or 
search variables for different keywords. A first search factor could include a query for the 
word "snorkel" and a second search factor could include a query for the word "scuba." 

Search criteria for use in the present invention may include but are not limited to the 
number of word-matches identified for user-identified keywords in a document, the age of the 
document, the number of links leading to the document, the number of links within the 
document leading to other documents, the length of the document (as measured in words, 
sentences, pages, or paragraphs), the number of words per sentence, the number of words per 
paragraph, the language in which the document is written, and the readability of a document. 
Herein, the term "readability" or "intellectual grade" of a document generally corresponds to 
the educational requirement needed to comprehend the contents of such document, such as is 
measured by certain grammatical analysis programs including but not limited to: the Flesch 
readability index and the Fox index. Such an attribute may be helpful where a user wishes to 
find documents on a particular subject for a high school student and wishes to avoid 
retrieving documents requiring a Master's degree for full comprehension of its contents. 
Preferably, the readability criterion, where employed, may be employed to screen documents 
for a range of educational levels. Such readability index is preferably not limited to a one- 
dimensional measure of intellectual skill. For instance, the readability index could be 
established to screen for documents according to defined skill levels in different intellectual 
areas such as, for instance, mathematics, literacy in English, fluency in English or other 
language, and proficiency in a specialized field such as computer science. 
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In a preferred embodiment, the inventive approach enables a user to combine the 
user's search preferences with regard to keyword searching and one or more document 
attributes in a single search operation, thereby yielding search results which best satisfy the 
user's preferences. Where, for instance, the user wants documents having a substantial level 
5 of recency in addition to exhibiting a good match with user search terms, a result may be 
generated which provides an effective combination of web page recency and keyword 
matching rather than presenting a web page having either good keyword matching but which 
is too old, or a document which is very recent but which has a poor keyword match with the 
user's search terms. Moreover, users may modify the relative weightings desired for various 
10 search criteria in successive searches, if prior searches prove unsatisfactory. For example, 
Z : where one search retrieves results with sufficient keyword matches but with documents which 
K[ are too old, the user is preferably able to readily modify the search criteria to increase the 

value of document recency with respect to the value of keyword matching. A different search 
s £- result more accurately matching the user's preferences would preferably result. 

Therefore, it is an advantage of a preferred embodiment of the present invention that a 
p user may conduct a search for documents which simultaneously takes account of keyword 
r ■ matching and one or more document attributes. 

£j It is a further advantage of a preferred embodiment of the present invention that a user 

r may adjust the relative weighting of various search criteria employed to order the results of a 
20 document or web page search. 

It is a still further advantage of a preferred embodiment of the present invention that a 
user may vary the relative weighting of the search criteria in successive searches in order to 
optimize a search result. 

FIGURE 1 depicts a sequence of steps for retrieving search results according to a 
25 preferred embodiment of the present invention. The succeeding discussion of FIGURE 1 
presents a general discussion of the operation of the inventive search mechanism. A more 
detailed treatment of the calculation of matching scores is presented thereafter. 

#250001 57vl< 



Attorney Docket No. 10005531-1 PATENT 

9 

In a preferred embodiment, a user selects search criteria to be employed in searching 
for documents in step 101 . Herein, the documents being searched may be pages on the World 
Wide Web, but it will be appreciated that other types of electronic documents stored or 
identified by metadata on a wide range of other databases or storage devices may also be 
5 searched employing the mechanism of the present invention, and all such variations are 
included within the scope of the present invention. 

At step 102, a user preferably identifies the direction of the effect on the search result 
of each selected search criterion. For example, with respect to the "age" criterion, a user 
could indicate whether younger or older documents are preferred. 

Ifi Additionally or alternatively to assessing a document's met a data, the algorithms 

J presented herein may be applied to individual search terms or phrases. A user could indicate 
p : whether a document should be favored or disfavored based upon the presence of certain 
j : words or phrases therein. For example, a search for documents pertaining to a vacation 

involving snorkeling but not scuba diving could direct the inventive search engine to favor 
16 documents including the term "snorkeling" and to disfavor documents including the phrase . 
(fi "scuba diving." The user may specify the weight both of the terms to be favored and those to 
]Z be disfavored in an ensuing search. 

Zi In the prior art, a weight of terms to be favored or included may optionally be 

specified, but the weight of terms to be disfavored or excluded is generally not available. A 

20 limitation arising from the prior art omission of "exclusion-weighting" is that documents that 
could be considered good search results due to a high number or density of references to a 
favored term, such as "snorkeling," but which include as little as one reference to a disfavored 
term, such as "scuba diving," would be completely excluded from a generated search result, 
thereby denying the searcher a potentially desirable search result document. However, the 

25 inventive search engine preferably includes the ability to promote documents including many 
references to "snorkeling" while simultaneously including the ability to demote to varying 
degrees but not necessarily completely eliminating, documents including the term "scuba 
diving," Generally, the degree of promotion or demotion of a document is determined by 
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combining the values of various user search variable selections and the prevalence of 
identified terms or phrases in documents being evaluated as potential search results. 

In a preferred embodiment, overall search results are generally calculated based on a 
combination of search criterion matching scores associated with keyword matching and with 
one or more document attributes. Where more than one keyword match query is submitted, 
an overall keyword matching score is preferably calculated from a combination of matching 
scores associated with individual keyword queries. 

At step 103, the user preferably enters a weighting value, or scaling factor, to be 
applied to each search criterion by the search mechanism or search engine. Where the age of 
a document is only moderately important but matching of a keyword term is very important, 
scaling factors reflecting these respective weightings are preferably applied to matching 
scores reflecting the extent of a match between each searched document and the user's search 
criteria. A calculation method for implementing such scaling criteria is presented in detail 
elsewhere herein. It will be appreciated that such scaling factors may be applied to range of 
search criteria other than document age and keyword matching. 

At step 104, the inventive search engine preferably calculates matching scores for 
each criterion as applied to each searched document based on the extent to which the 
document matches such criterion. Such matching scores are preferably combined to calculate 
an overall document matching score, or overall matching score, for a document. 

At step 105, the inventive mechanism preferably generates an overall matching score 
for each searched document. This is preferably accomplished by multiplying the value of 
each criterion matching score by its associated scaling factor, squaring the product of each 
scaling factor-criterion matching score, summing the squares of the scaling factor-criterion 
matching score products, and taking the square root of this sum to determine the overall 
matching score for a particular document. This approach is shown in equation 1 below. It 
will be appreciated that other computational approaches could be employed to generate a 
single number representing the combined effect of the various scaling factors and criterion 



#25000157vl< 



Attorney Docket No. 1000553 1-1 PATENT 

11 

matching scores, and all such variations are included within the scope of the present 
invention. 

At step 106, the inventive search engine preferably orders documents according to the 
overall matching score for each examined document. The documents will be generally be 
listed in order of descending overall matching score. At step 107, the search engine 
preferably retrieves and displays the ordered documents for a user. 

FIGURE 2 depicts a mechanism for adjusting scaling factors for document attributes 
according to a preferred embodiment of the present invention. This mechanism may be a text 
box 201 to accept the keyword search and a plurality of user adjustable settings 202-204 for 
establishing the weighting, as embodied in a scaling factor, of each search criterion. FIGURE 
2 shows this arrangement for three search criteria, specifically, document age 202 , links 203 
(which may be incoming or outgoing), and readability 204 (or intellectual grade of the 
document). However, it will be appreciated that the inventive search engine could enable a - 
user to modify the weightings of any number of document attributes, such as for instance, 
document length, and all such variations are included within the scope of the present 
invention. 

In a preferred embodiment, a user operates interface 200 by entering keywords into 
text box 201 and/or adjusting selected ones of settings 202-204 (and/or other document 
attribute settings) to indicate the relative importance of the document attributes, and clicking 
the search button 205 to activate a search. In general, the weightings of the various document 
attributes are established relative to the weighting of the keyword match result whose scaling 
factor is generally set to a value of 1 . 

FIGURE 3 depicts data entry to and output from search engine 302 according to a 
preferred embodiment of the present invention. Preferably, user entry data 301 is input to 
search engine 302 which generates search results 303 which are sorted based on all document 
attributes as well as keyword match queries included in user entry data 301. 

In a preferred embodiment, search engine 302 generates a result list in ranked order 
determined by an overall matching score calculated according to equation 1, below. The 
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system may optionally store the user's preferences regarding the scaling factors so that upon 
return to the interface 200, the user does not have to readjust the positions of sliders 202-204. 

The following presents a preferred approach for determining the result list ranking. It 
is assumed that the results for each search criterion are orthogonal (independent of one 
another) and that the search criteria generate matching scores when applied to a document. 
These orthogonal matching scores then preferably generate a point in an n-dimensional 
space. For example, in FIGURE 4, there is a three dimensional space 400. Specifically, one 
dimension is the keyword match score 401, a second dimension is the age score 402, and a 
third dimension is links score 403. 

In a preferred embodiment, points from three dimensional space 400 may then be 
projected on to a one dimensional result list. It will be appreciated that, based on the number 
of search criteria entered by a user, space 400 may include fewer or more than three 
components or dimensions. 

A preferred approach to generating a one-dimensional result list is discussed herein. 
However, other approaches to generation of such a list will be apparent to those of skill in the 
art. Herein, a criterion matching score is preferably calculated from a criterion matching 
result and an associated origin offset. 

Preferably, the point distance from origin 404 to points 405-407 at the ends of result 
vectors 402-403, respectively, is the measure of the document relevance (or vector 
magnitude) for each of the selected criteria. Whether such document relevance operates to 
favor or disfavor a high ranking of the document generally depends upon the value selected 
for the origin offset. 

Preferably, a vector drawn from origin 404 to any of points 405-407 at the ends of 
result vectors 401-403, respectively, represents such vector's magnitude. This value is 
preferably combined with user-entered information to determine an overall matching score 
for a document. The user-entered information is preferably employed to determine origin 
offsets and scaling factors for each of the search criteria. 
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The following steps are preferably performed to calculate the overall rank or overall 
matching score for a document. First, the matching results for each search criterion are 
preferably normalized to (or linearly mapped into) a standard range so that the numbers 
associated with results from each of the search criteria may be meaningfully combined. 
Herein, the results for each criterion are preferably normalized to the numeric range {0,100}. 
However, it will be appreciated that any positive numerical range will enable operation of the 
inventive search engine so long as the numerical ranges are consistent for each user-selected 
search criterion. 

In a preferred embodiment, search results for a search criterion are normalized into a 
preferred numeric range by finding the highest and lowest numerical results associated with a 
particular search criterion and scaling the numerical gap between these highest and lowest " 
results to the preferred range, which may be user-selected. For example, where, for a 
particular search criterion, the lowest returned numerical result is 20 and the highest is 420, 
the numerical gap between the highest and lowest results is 400. A case where the user 
desires to use a numeric range of 0-100 is considered. In this instance, scaling a returned 
result to the 0-100 range would preferably involve subtracting 20 from the returned result (or 
search criterion matching result) and then dividing the resulting number by 4. In this manner, 
a result of 20 would yield a normalized result of 0, and a result of 420 would return a 
normalized result of 100. Thus, in this instance, the normalization offset is 20 and the 
normalization constant is 4. In this case, a search criterion matching result of 120 would 
yield a normalized value of (120-20)/4 = 25. It will be appreciated that in an alternative 
embodiment, the normalization operation could involve a range of different numerical 
operations including both linear and/or non-linear computations. 

The value of the overall matching score for a particular document may be calculated 
as follows: 

r i = VW k i + °k)) 2 + M a l + °a)) 2 + (siO, + O,)) 2 . . . (Eq. 1) 

wherein: 
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r s is preferably the calculated rank (or overall matching score) for the i* document. 
s k is preferably the keyword scaling factor. For the purpose of the instant discussion, the 
keyword scaling factor is assumed to have a value of "1." However, this scaling factor could 
be provided with any value in the range {-lowerlimit, 0, +upperlimit} . The optimum values 
of -lowerlimit and +upperlimit may be determined empirically, however in the preferred 
embodiment, the range is {-1 ,0,+l } . Alternatively, other ranges, both symmetric and 
asymmetric, may be used, such as, for instance, {-1.5,0,+1.5} or {-0.75,0,+1.5}. 

k ; is preferably the keyword matching result for the k document and is preferably in the range 
{0,100}. 

0 k is preferably the keyword origin offset. 0 k is preferably set to a value of 0 where multiple 
occurrences of the pertinent keyword favor a high ranking of the document and is preferably 
set to -100 when the search favors documents to an increasing degree with diminishing 
frequency of occurrence of the pertinent search term. 

s a is preferably the age scaling factor and is set to a value in the range 

{-lowerlimit,0,+upperlimit} as determined by the position of the adjustment setting 202. The 
optimum values of -lowerlimit and +upperlimit may be determined empirically. However, in 
a preferred embodiment, the range is {-1,0,+1}. Other ranges, both symmetric and 
asymmetric, may be used, such as, for instance, {-1.5,0,+1.5} or {-.75,0,+1.5}. 

When a user selects the "don't care" condition for any of settings 202-204 (FIGURE 
2), the value of the scaling factor associated with that setting is generally 0. Preferably, the 
relationship of the adjustment of setting 202 to the value of s a may be either linear or non- 
linear. A process of trial and error and/or analysis may be employed to determine an 
optimum relationship between the position of setting 202 and the value of s a for the purpose 
of optimizing the operation of the inventive search engine. 

3i is preferably the age score normalized to the range {0,100} . The age score is preferably 
determined by measuring the age of the document in a recognized chronological unit (such as 
days) and normalizing to a range of {0,100} using the following linear mapping function: 
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a i = 



100 / 

(d i - d 



d _ d V a i-<W (Eq.2) 



max mm 



where is preferably the age of the oldest document, d^ is preferably the age of 
the youngest document, and d is preferably the age of a document the attributes of which are 
currently under evaluation. Generally, the highest score will be awarded to the oldest 
document. If the user prefers recent documents, the resulting effect on the overall matching 
score may be modified via adjustment of the value of the age origin offset o a . 



°a = 



In a preferred embodiment, the value of o a is generally 0 where the user prefers older 
documents and -100 where the user prefers recent documents. It will appreciated that 
alternative numerical values for o a may be employed, and that all such variations are included 
within the scope of the present invention. 

In a preferred embodiment, 1, is the link score in the range {0,100}. The link score is 
preferably determined by counting the number of incoming links to the document and 
normalizing this count to a number within the range of 0-100 using the following linear 
mapping function: 

! i - ~ ~ l C i - C min J (Eq. 4) 

max min 



In a preferred embodiment, with regard to equation 4 above, c max is the link count for 
the document with the greatest number of incoming links, c min is the link count for the 
document with the fewest incoming links, and c, is the link count of the i* document (the 
document under consideration). Generally, the highest score will be awarded to documents 
with the most links. However, if the user prefers documents with fewer links, the resulting 
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effect on the overall matching score may be modified via adjustment of the value of the link 
origin offset Oj. 



In a preferred embodiment, the value of Oj is set to 0 for user selections where the user 
prefers documents with more links and to -100 where the user prefers documents with fewer 
links. In a preferred embodiment, Sj is the incoming link scaling factor. s 2 is preferably set to 
a value in the range {-Iowerlimit,0,upperlimit} as determined by the position of link count 
adjustment setting 203 (FIGURE 2). Generally, the optimum values of -lowerlimit and 
-Hipperlimit may be determined empirically. However, a preferred range is {-1,0,1}. Other 
ranges both symmetric and asymmetric may be used, such as for instance, {-1.5,0,1.5} or 
{-.75,0,1 .5} . Where a user selects the "don't care" condition for count adjustment 203, the 
value of the origin offset is 0. In a preferred embodiment, the relationship of count 
adjustment setting 203 to the value of S k may be linear or non-linear. 

In a preferred embodiment, the values of r { for searched documents are evaluated and 
the documents then ordered according to the r { values. Generally, the documents are 
presented in order of descending value of r { . 

FIGURE 5 depicts an exemplary search result 500 ordered by keyword matching 
score. In FIGURE 5, three options are presented for sorting search results: by score 501, by 
date 502, and by document type 503. It may be seen that "score" option 501 is selected. In 
the search result table, columns are provided indicating the score, type, date, and size of each 
document. 

Continuing with the example, it may be seen that under the score column heading 
504, the keyword scores of the listed documents begin at 70 for the first document 507, and 
diminish from there to 68, 66, and then 66 again, for results 507, 508, 509, and 510, 
respectively. While this approach effectively isolates documents presenting the best keyword 
matches with the entered keyword 511, the dates of the earliest-listed documents are scattered 




(Eq. 5) 
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over a substantial range of time. It is apparent that where a user desires to recover documents 
with good keyword matching and substantial recency of document creation, effort would 
generally have to be expended to locate the desired documents within a list of search results. 

FIGURE 6 depicts the earliest-listed results 600 of a search ordered by document date. 
It may be seen that in the search results 600 listed in FIGURE 6, the date option 502 is 
selected for sorting the documents. Under the date column heading 506, the results are 
shown listed in order of increasing age, with the newest document 601 having a date of 
November 22, 2000. Under the "score" column heading 504, it may be seen that the scores 
vary with no particular pattern among search results 601-605. 

Continuing with the example, and turning to FIGURE 7, a set of search results 700 
arising from the same search associated with FIGURE 6 is presented. It may be seen that 
results 708-710 have fairly high keyword scores of 63 and dates of May 18, 2000, thereby 
presenting an effective combination of document recency and keyword matching. However, 
the results listed in FIGURE 7 represent the third page of the search results for which the first 
page is shown in FIGURE 6. A user would generally have to manually look through a 
substantial number of search results, employing the search mechanism depicted in FIGURES 
6 and 7, in order to find desirable results 708-710, thereby expending valuable time and 
effort. 

FIGURE 8 depicts a listing of results 800 arising from a search conducted according 
to a preferred embodiment of the present invention. Column headings 801-805 point out 
keyword score, age in days, normalized age, adjusted relevance score, and document 
description respectively. 

The results in FIGURE 8 are ordered according to descending value of adjusted 
relevance score 804 according a preferred embodiment of the present invention. Adjusted 
relevance score 804 generally corresponds to the term "overall matching score" employed 
elsewhere herein. The adjusted relevance score 804 is calculated according to algorithms 
presented elsewhere herein in connection with the overall matching score, to effectively 
combine the recency and the extent of the keyword match for each document. In this manner, 
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the documents having the best combination of recency and keyword matching, according to 
user-supplied relative weighting of the two criteria, are presented at the top of the list instead 
of being randomly scattered throughout several pages of results. 

The consequence of combining the effects of keyword matching and document 
5 recency may be seen by examining documents 812 and 813. Document 812 has a relatively 
high keyword score of 68 and a low level of recency, being 1071 days old. Document 813 
has a relatively low keyword score of 47 and relatively high recency level, being only 26 days 
old. The adjusted relevance scores of the documents 812 and 813 are however quite close, at 
68.0 and 67.9, respectively. 

ID FIGURE 9 illustrates computer system 900 adaptable for use with a preferred 

Ju embodiment of the present invention. Central processing unit (CPU) 901 is coupled to 
£ ; system bus 902. CPU 901 may be any general purpose CPU, such as a Hewlett Packard 
H ; PA-8200. However, the present invention is not restricted by the architecture of CPU 901 as 

; long as CPU 901 supports the inventive operations as described herein. Bus 902 is coupled 
15 to random access memory (RAM) 903, which may be SRAM, DRAM, or SDRAM. ROM 

904 is also coupled to bus 902, which may be PROM, EPROM, or EEPROM. RAM 903 and 
ROM 904 hold user and system data and programs as is well known in the art. 

:. Bus 902 is also coupled to input/output (I/O) adapter 905, communications adapter 
card 911, user interface adapter 908, and display adapter 909. The I/O adapter 905 connects 

20 to storage devices 906, such as one or more of hard drive, CD drive, floppy disk drive, tape 
drive, to computer system 900. Communications adapter 91 1 is adapted to couple computer 
system 900 to network 912, which may be one or more of local area network (LAN), 
wide-area network (WAN), Ethernet or Internet network. User interface adapter 908 couples 
user input devices, such as keyboard 913 and pointing device 907, to computer system 900. 

25 Display adapter 909 is driven by CPU 901 to control the display on display device 910. 

In a preferred embodiment, user interface 200 is presented on display device 910. 
Information for entry into user interface 200 may be provided by one or more of keyboard 
913 and pointing device 907. Preferably, CPU 901 is employed to calculate various matching 
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scores discussed elsewhere herein. It will be appreciated that computer systems having 
configurations and components differing from that of computer system 900 may be employed 
in conjunction with the present invention, and all such variations are included within the 
scope of the present invention. 
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